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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/689 , 343D 



DATE: 10/22/200 
TIME: 16:29:38 
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Input Set : A:\NEB-181-2.txt 

Output Set: N:\CRF4\10222002\I689343D.raw 



<110> APPLICANT: Vaisvila, Romualdus 

Morgan, Richard D. 

Kucera, Rebecca B. 

Claus, Toby B. 

Raleigh, Elisabeth A. 
<120> TITLE OF INVENTION: Method For Cloning And 

Endonuclease 
<130> FILE REFERENCE: NEB-181 

<140> CURRENT APPLICATION NUMBER: US 09/689, 343D 

<141> CURRENT FILING DATE: 2000-10-12 

<16 0> NUMBER OF SEQ ID NOS : 9 

<170> SOFTWARE: Patentln version 3.1 

<210> SEQ ID NO: 1 

<211> LENGTH: 903 

<212> TYPE: DNA 

<213> ORGANISM: Micrococcus sp . 
<220> FEATURE: 
<221> NAME/KEY 
<222> LOCATION 



2 Q 9 



Producing The Msel Restriction 



T 



CDS 
(!)■ 



(900) 



<223> OTHER INFORMATION: 
<400> 1 



33 


atg 


cct 


ate 


teg 


ace 


gtc 


tgg 


acg 


ccg 


gac 


gga 


gac 


gac 


etc 


ate 


gtg 


48 


34 


Met 


Pro 


He 


Ser 


Thr 


Val 


Trp 


Thr 


Pro 


Asp 


Gly 


Asp 


Asp 


Leu 


He 


Val 




35 


1 








5 










10 










15 






37 


gag 


gcg 


gac 


aac 


etc 


gat 


ttc 


att 


caa 


acg 


etc 


ccc 


gac 


gcg 


age 


ttc 


96 


38 


Glu 


Ala 


Asp 


Asn 


Leu 


Asp 


Phe 


He 


Gin 


Thr 


Leu 


Pro 


Asp 


Ala 


Ser 


Phe 




39 








20 










25 










30 








41 


cga 


atg 


ate 


tac 


ate 


gat 


ccg 


ccg 


ttc 


aac 


aca 


ggg 


cga 


acg 


cag 


egg 


144 


42 


Arg 


Met 


He 


Tyr 


He 


Asp 


Pro 


Pro 


Phe 


Asn 


Thr 


Gly 


Arg 


Thr 


Gin 


Arg 




43 






35 










40 










45 










45 


ctt 


cag 


teg 


etc 


aag 


acg 


ace 


cgc 


teg 


gtc 


aca 


ggg 


teg 


cga 


gtc 


ggc 


192 


46 


Leu 


Gin 


Ser 


Leu 


Lys 


Thr 


Thr 


Arg 


Ser 


Val 


Thr 


Gly 


Ser 


Arg 


Val 


Gly 




47 




50 










55 










60 












49 


ttc 


aaa 


ggc 


cag 


acg 


tac 


gac 


acg 


gtc 


aag 


age 


act 


ctg 


cac 


teg 


tat 


240 


50 


Phe 


Lys 


Gly 


Gin 


Thr 


Tyr 


Asp 


Thr 


Val 


Lys 


Ser 


Thr 


Leu 


His 


Ser 


Tyr 




51 


65 










70 










75 










80 




53 


gac 


gac 


get 


ttc 


ace 


gac 


tat 


tgg 


teg 


ttc 


etc 


gaa 


ccg 


cgt 


etc 


ctg 


288 


54 


Asp 


Asp 


Ala 


Phe 


Thr 


Asp 


Tyr 


Trp 


Ser 


Phe 


Leu 


Glu 


Pro 


Arg 


Leu 


Leu 




55 










85 










90 










95 






59 


gag 


get 


tgg 


egg 


ttg 


etc 


ace 


cct 


gac 


ggc 


gcg 


etc 


tat 


ctt 


cat 


ctg 


336 


60 


Glu 


Ala 


Trp 


Arg 


Leu 


Leu 


Thr 


Pro 


Asp 


Gly 


Ala 


Leu 


Tyr 


Leu 


His 


Leu 




61 








100 










105 










110 








63 


gat 


tac 


cgc 


gag 


gtt 


cac 


tac 


gec 


aag 


gtc 


gtc 


etc 


gac 


gcg 


atg 


ttc 


384 
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64 Asp Tyr Arg Glu Val His Tyr Ala Lys Val Val Leu Asp Ala Met Phe 

65 115 120 125 

67 gga cgc gaa age ttc ctg aac gag ctg ate tgg gcg tac gac tac ggc 4 32 

68 Gly Arg Glu Ser Phe Leu Asn Glu Leu lie Trp Ala Tyr Asp Tyr Gly 

69 * 130 135 140 

71 gcg cgc teg aag age aag tgg ccc acc aag cac gac aac ate etc gtg 4 80 

72 Ala Arg Ser Lys Ser Lys Trp Pro Thr Lys His Asp Asn lie Leu Val 

73 145 150 155 160 

75 tat gtg aag gac ccg aac aac tac gtc tgg aac ggt cag gat gta gat 528 

76 Tyr Val Lys Asp Pro Asn Asn Tyr Val Trp Asn Gly Gin Asp Val Asp 

77 165 170 175 

79 cgc gag ccc tac atg gcg ccc ggg etc gtt aca ccc gag aag gta gcg 576 

80 Arg Glu Pro Tyr Met Ala Pro Gly Leu Val Thr Pro Glu Lys Val Ala 

81 180 185 190 

83 ctt ggc aag ctg ccc acc gac gtc tgg tgg cac aca ate gtt ccg cct 624 

84 Leu Gly Lys Leu Pro Thr Asp Val Trp Trp His Thr lie Val Pro Pro 

85 195 200 205 

87 gcg age aaa gag cgc acc ggg tac gcg aca cag aag ccg gtc ggc ate 672 

88 Ala Ser Lys Glu Arg Thr Gly Tyr Ala Thr Gin Lys Pro Val Gly He 

89 210 215 220 

91 ate cgt cgc atg att cag gcg age age aat gaa ggc gac tgg gtt ctg 720 

92 He Arg Arg Met He Gin Ala Ser Ser Asn Glu Gly Asp Trp Val Leu 

93 225 230 235 240 

95 gat ttc ttc get ggt agt ggg acg acc ggc gec gcg gec cgc cag etc 768 

96 Asp Phe Phe Ala Gly Ser Gly Thr Thr Gly Ala Ala Ala Arg Gin Leu 

97 245 250 255 

99 gga cgc cgt ttt gtg etc gta gac gtc aac cca gaa gca ate gcg gta 816 

100 Gly Arg Arg Phe Val Leu Val Asp Val Asn Pro Glu Ala He Ala Val 

101 260 265 270 

103 atg gca aaa egg ttg gat gac ggg gca ttg gac acc age gtg acg ate 864 

104 Met Ala Lys Arg Leu Asp Asp Gly Ala Leu Asp Thr Ser Val Thr He 

105 275 280 285 

107 gtg cag act ccc cag agt gac cca cga acc gac gga tga 903 

108 Val Gin Thr Pro Gin Ser Asp Pro Arg Thr Asp Gly 

109 290 295 300 

111 <210> SEQ ID NO: 2 

112 <211> LENGTH: 300 

113 <212> TYPE: PRT 

114 <213> ORGANISM: Micrococcus sp . 
116 <400> SEQUENCE: 2 

118 Met Pro He Ser Thr Val Trp Thr Pro Asp Gly Asp Asp Leu He Val 

119 15 10 15 

121 Glu Ala Asp Asn Leu Asp Phe He Gin Thr Leu Pro Asp Ala Ser Phe 

122 20 25 30 

124 Arg Met He Tyr He Asp Pro Pro Phe Asn Thr Gly Arg Thr Gin Arg 

125 35 40 45 

127 Leu Gin Ser Leu Lys Thr Thr Arg Ser Val Thr Gly Ser Arg Val Gly 

128 50 55 60 

130 Phe Lys Gly Gin Thr Tyr Asp Thr Val Lys Ser Thr Leu His Ser Tyr 
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1 ^1 


65 








70 










75 










80 


1 Ol 

IJJ 


Asp 


Asp Ala Phe 


X Hi 


nop 


xyr 


xrp 


Ser 


Phe 


Leu 


Glu 


Pro 


rtx y 


Leu 


Leu 


1 








ft S 










90 










95 




1 7fi 
IjD 


Glu 


Ala Trp Arg 


JjCU 


Leu 


Thr 


Pro 


Asp 


Glv 


Ala 


Leu 


Tvr 


Leu 


His 


Leu 


1 77 






100 










105 










110 






1 7Q 

J. J -3 


Asp 


Tyr Arg Glu 


Va 1 

V CLX 




x yi 


Ala 


Lys 


Val 


Val 


Leu 


Asp 


Ala 


Met 


Phe 


1 4.0 






115 








120 










125 








1 49 


Gly Arg Glu Ser 




T.p»i i 
ucu 


Asn 


Glu 


Leu 


lie 




Ala 


Tvr 


Asp 


Tvr 


Glv 

\j j_ jr 


1 4. 7 




130 






1 

JL J -J 










140 












Ala 


Arg Ser Lys 


Ser 


Lys 


rp -»~ r> 

irp 


JTX \J 


rp V| -p 
J. 11X 




Hi <? 




As n 


lie 


Leu 


Val 


1 A 
Xft D 


145 








1 RO 










155 










160 


1 Aft 


Tyr 


Val Lys Asp 


Pro 


Asn 


As n 


xyr 


V CLX 


xxp 


A en 
AO 11 


Gly 


Gin 


Asp 


Val 


Asp 


1 A Q 








1 fx R 










170 










175 






Arg 


Glu Pro Tyr 


wet 


r\XcL 


Ir X \J 


uiy 


jjeu 


V ClX 


•PVt t* 

X 11X 


Pro 


Glu 


Lys 


Val 


Ala 








180 










185 










190 








Leu 


Gly Lys Leu 


IT _L KJ 


X I1X 


Asp 


V CLX 


irp 


xrp 


His 


Thr 


lie 


Val 


Pro 


Pro 


1 jj 






195 


















205 








X D / 


Ala 


Ser Lys Glu 


Arg 


X I1X 




xyr 


rlXCL 


Thr* 

X 11X 


OX 11 


Lys 


Pro 


Val 


Gly 


lie 


1 JO 




210 






£t JL *J 










220 










i o 

JL D \J 


He 


Arg Arg Met 


He 




Ala 


Ser 


Ser 


Asn 


Glu 


Glv 


Asp 


Tro 

-t-tr 


Val 


Leu 


X D JL 


225 








230 










235 










240 


1 67 


Asp 


Phe Phe Ala 


Gly 


Ser 


Glv 


Thr 


Thr 


Glv 


Ala 


Ala 


Ala 


Arcr 


Gin 


Leu 


1 6A 
J. o ** 








245 










250 










255 




1DD 


Gly Arg Arg Phe 


Val 


Leu 


Val 


Asp 


Val 


Asn 


Pro 


Glu 


Ala 


lie 


Ala 


Val 


J. D f 






260 










265 










270 






169 


Met 


Ala Lys Arg 


Leu 


Asp 


Asp 


Gly 


Ala 


Leu 


Asp 


Thr 


Ser 


Val 


Thr 


lie 


X / \J 






275 








280 










285 








179 


Val 


Gin Thr Pro 


Gin 


Ser 


Asp 


Pro 


Arg 


Thr 


Asp 


Glv 










17 7 

x / o 




290 






295 










300 










1 7 R 


<210> 


SEQ ID NO: 


: 3 
























1 7fi 
X / O 


<211> 


LENGTH: 1236 
























1 77 

X / / 


<212> 


TYPE: DNA 


























1 7ft 
X / o 


<213> 


ORGANISM: 


Unknown 






















i fto 


<220> 


FEATURE : 


























1 ftl 
X ox 


<223> 


OTHER INFORMATION 


: Environmental DNA 












1 R7 


<220> 


FEATURE : 


























1 flA 


<221> 


NAME/KEY : 


CDS 
























185 


<222> 


LOCATION: 


(!)■ 


. . (1233) 




















186 


<223> 


OTHER INFORMATION 






















188 


<220> 


FEATURE : 


























189 


<221> 


NAME/KEY : 


misc_f eature 


















190 


<222> 


LOCATION: 


(198) . . 


(198) 




















191 


<223> 


OTHER INFORMATION: Xaa = any amino acid 










W--> 193 


<220> 




























194 


<221> 


NAME/KEY : 


mis c_f eature 


















195 


<222> 


LOCATION: 


(594) . . 


(594) 




















196 


<223> 


OTHER INFORMATION 


: N= 


G, A, C 


or T 












198 


<400> 


SEQUENCE : 


3 
























200 


atg 


cct aca ctg 


gat 


tgg 


ccc 


ggt 


aaa 


cag 


tta 


age 


ttc 


cca 


cca 


get 
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201 Met Pro Thr Leu Asp Trp Pro Gly Lys Gin Leu Ser Phe Pro Pro Ala 

202 15 10 15 

204 acc tec ttg cat ctg gag agt gtg gtc act gag gga gcg gag tea ccg 96 

205 Thr Ser Leu His Leu Glu Ser Val Val Thr Glu Gly Ala Glu Ser Pro 

206 20 25 30 

208 cct aat cgt ctg att tgg gcg gac aac ctg ccg eta atg gta gat ttg 144 

209 Pro Asn Arg Leu lie Trp Ala Asp Asn Leu Pro Leu Met Val Asp Leu 

210 35 40 45 

212 ttg gee gaa tat gaa ggg aaa ate gat ctg ate tac gec gat ccc cct 192 

213 Leu Ala Glu Tyr Glu Gly Lys lie Asp Leu lie Tyr Ala Asp Pro Pro 

214 50 55 60 

216 ttt ttt acg gat cgt act tat gcg gcg cga att ggt cat ggg gag gat 24 0 

217 Phe Phe Thr Asp Arg Thr Tyr Ala Ala Arg lie Gly His Gly Glu Asp 

218 65 70 75 80 

220 teg cgt cgt cca caa acc tgg cag ctt gca gaa gga tat acg gac gag 288 

221 Ser Arg Arg Pro Gin Thr Trp Gin Leu Ala Glu Gly Tyr Thr Asp Glu 

222 85 -90 95 

224 tgg aag gat tta gat gaa tac ctg gac ttc ctt tat cca cgc ctg gta 336 

225 Trp Lys Asp Leu Asp Glu Tyr Leu Asp Phe Leu Tyr Pro Arg Leu Val 

226 100 105 110 

229 ctg atg tat cga ctg ctg gca cca cac gga acg etc tac ttg cac ctg 384 

230 Leu Met Tyr Arg Leu Leu Ala Pro His Gly Thr Leu Tyr Leu His Leu 

231 115 120 125 

233 gac tgg cac gec aat gee tac gta cgt gta ctg ctt gat gag ate ttc 432 

234 Asp Trp His Ala Asn Ala Tyr Val Arg Val Leu Leu Asp Glu lie Phe 

235 ' 130 135 140 

237 ggg cga cag egg ttt etc aac gag ate gtc tgg ate tat cac ggc ccc 480 

238 Gly Arg Gin Arg Phe Leu Asn Glu lie Val Trp lie Tyr His Gly Pro 

239 145 150 155 160 

241 tea gec ate cga cgc gee ttc aag cgc aaa cat gat acc ate ttg gtt 528 

24 2 Ser Ala lie Arg Arg Ala Phe Lys Arg Lys His Asp Thr lie Leu Val 

243 165 170 175 

24 5 tat gtg aaa ggt gaa aac tat aca ttc aat gcg gat gcg gtt cgt caa 576 

24 6 Tyr Val Lys Gly Glu Asn Tyr Thr Phe Asn Ala Asp Ala Val Arg Gin 

247 180 185 190 

249 cct tac cat ccg age acn cat aag acc ttc get tec tec ccg aag gec 624 

250 Pro Tyr His Pro Ser Xaa His Lys Thr Phe Ala Ser Ser Pro Lys Ala 

251 195 200 205 

253 ggc ttt ggt aag gtg ccg gat ctg cag cgc ggc aaa gtg ccc gaa gac 672 

254 Gly Phe Gly Lys Val Pro Asp Leu Gin Arg Gly Lys Val Pro Glu Asp 

255 210 215 220 

257 tgg tgg tat ttt ccg gtc gtg gee cgt eta cac cga gaa egg age ggc 720 

258 Trp Trp Tyr Phe Pro Val Val Ala Arg Leu His Arg Glu Arg Ser Gly 

259 225 230 235 240 

261 tat ccg act caa aag cct caa gee ttg ctg gag egg ate ctg ctg gee 768 

262 Tyr Pro Thr Gin Lys Pro Gin Ala Leu Leu Glu Arg lie Leu Leu Ala 

263 245 250 255 

265 tec teg aac gca ggc gat ctg gtg gca gac ttc ttc tgc ggc tea ggg 816 

266 Ser Ser Asn Ala Gly Asp Leu Val Ala Asp Phe Phe Cys Gly Ser Gly 



file://C:\CRF4\Outhold\VsrI689343D.htm 



10/22/02 



Page 5 of 8 



RAW SEQUENCE LISTING DATE: 10/22/2002 
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267 260 265 270 

269 aca acc get gtg gtg gca gec cgt ctg gga egg cgc ttc ctg gtc aac 864 

270 Thr Thr Ala Val Val Ala Ala Arg Leu Gly Arg Arg Phe Leu Val Asn 

271 275 280 285 

273 gat gca age tgg cgc gec gtt cat gtg aca cgc aca cgc ttg eta cgc 912 

274 Asp Ala Ser Trp Arg Ala Val His Val Thr Arg Thr Arg Leu Leu Arg 

275 290 295 300 

277 gag gga gta agt ttc act ttt gaa cgc cag gaa act ttt act eta cct 960 

278 Glu Gly Val Ser Phe Thr Phe Glu Arg Gin Glu Thr Phe Thr Leu Pro 

279 305 310 315 320 

281 ate cag cca ctt cca cca gat tgg ttg ate ate gec gag gag cag att 1008 

282 lie Gin Pro Leu Pro Pro Asp Trp Leu lie lie Ala Glu Glu Gin lie 

283 325 330 335 

286 cgc etc caa gca ccc ttt etc gta gat ttt tgg gaa gtg gac gat caa 1056 

287 Arg Leu Gin Ala Pro Phe Leu Val Asp Phe Trp Glu Val Asp Asp Gin 

288 340 345 350 

290 tgg gat ggc aaa ate ttc cgc age cgt cat caa ggc tta cgc tec cgc 1104 

291 Trp Asp Gly Lys lie Phe Arg Ser Arg His Gin Gly Leu Arg Ser Arg 

292 355 360 365 

294 ctt cag gag cag gcg ccg etc tct eta cca ttg acc ggg aat gga ctg 1152 

295 Leu Gin Glu Gin Ala Pro Leu Ser Leu Pro Leu Thr Gly Asn Gly Leu 

296 370 375 380 

298 ttg tgt gta egg gta gtg age cgt gaa ggg gaa tac tat gag ttc aca 1200 

299 Leu Cys Val Arg Val Val Ser Arg Glu Gly Glu Tyr Tyr Glu Phe Thr 

300 385 390 395 400 

302 ggt cga gec gat age cct cac ccc gta teg ttt tga 1236 

303 Gly Arg Ala Asp Ser Pro His Pro Val Ser Phe 

304 405 410 

306 <210> SEQ ID NO: 4 

307 <211> LENGTH: 411 

308 <212> TYPE: PRT 

309 <213> ORGANISM: Unknown 

311 <220> FEATURE: 

312 <223> OTHER INFORMATION: Environmental DNA 

314 <220> FEATURE: 

315 <221> NAME/KEY: misc_feature 

316 <222> LOCATION: (198).. (198) 

317 <223> OTHER INFORMATION: Xaa = any amino acid 
319 <400> SEQUENCE: 4 

321 Met Pro Thr Leu Asp Trp Pro Gly Lys Gin Leu Ser Phe Pro Pro Ala 

322 15 10 15 

324 Thr Ser Leu His Leu Glu Ser Val Val Thr Glu Gly Ala Glu Ser Pro 

325 20 25 30 

327 Pro Asn Arg Leu lie Trp Ala Asp Asn Leu Pro Leu Met Val Asp Leu 

328 35 40 45 

330 Leu Ala Glu Tyr Glu Gly Lys lie Asp Leu lie Tyr Ala Asp Pro Pro 

331 50 55 60 

333 Phe Phe Thr Asp Arg Thr Tyr Ala Ala Arg lie Gly His Gly Glu Asp 

334 65 70 75 80 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE: 10/22/2002 

PATENT APPLICATION: US/09/689, 343D TIME: 16:29:39 
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Please Note; 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:3; N Pos . 594 
Seq# :3; Xaa Pos. 198 
Seq#:4; Xaa Pos. 198 



/ 
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